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Abstract. We prove that e-cyclically monotone transport plans tt optimize 
the Monge-Kantorovich transportation problem under an additional measur- 
ability condition. This measurability condition is always satisfied for finitely 
valued, lower semi-continuous cost functions. In particular, this yields a pos- 
itive answer to Problem 2.25 in C. Villani's book. We emphasize that we do 
not need any regularity conditions as were imposed in the previous literature. 



We consider the Monge-Kantorovich optimal transport problem (jj,, v, c) for Borel 
measures pi, v on polish spaces X, Y and a lower semi-continuous cost function 
c: XxY — > R> U{+oo} (see C. Villani's beautiful book J] for all necessary details) : 
we define the set of transport plans n(/i, v) as the set of probability measures 
tt with marginal /i on X, and marginal v on Y, respectively. Furthermore, we 
define $(/!, v) as the set of pairs (<fi, ip) of Borel functions <f> : X — > K U {—00} and 
ijj : Y -> K U {-00} with <j) £ £ 1 (/x) and -0 £ C x {v), such that 

4>{x) +ip(y) < c(x,y) 

for all (x, y) £ X x Y. The Monge-Kantorovich problem is to minimize the cost 

7(tt) = / c{x,y)-K{dx,dy) 

J XxY 

for transport plans tt £ H(p,is). Dually to the Monge-Kantorovich problem, we 
maximize 

J(0,V)= / <f)(x)fi(dx) + / i){y)v{dy) 
Jx Jy 

over (cfj, ip) £ v). 

It is well-known that under the previous conditions, the equality 

inf I(tt) = sup J{4>,ip) 

holds true, and the infimum is in fact a minimum, if infn(^) ^( 7r ) < 00 (see [8], 
Th.1.3). We know furthermore that optimizers tt with I{tt) < 00 are concentrated 
on c-monotone (see the following definition) Borel sets rclxy. These assertions 
are proved for instance in [8l ch.2] or in [H Th. 3.2]. 
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Definition 1. A set T C X xY is called c- monotone if for all n > 1, for all points 
(xi, y%) G I\ i = 1, . . . , n, and all permutations a G & n 



i=l i=l 

holds true. A transport plan tt G Yl(fj,, v) is called c -monotone if there is a c- 
monotone Borel set Y C X x Y , where it is concentrated on, i.e. tt(Y) = 1. 

In [8] it was posed as an open problem whether c-monotone transport plans tt G 
II(/x, v) are necessarily optimizers of the Monge-Kantorovich problem (see Problem 
2.25 in [8j and the references therein). Due to PQ it is known that this conjecture 
holds true under a certain moment conditions on the probability measures [i, v with 
respect to the cost function c. 

Crucial for these considerations is Ruschendorf's Theorem (see [5] and [H ch.2]), 
which relates c-concave functions and c-monotone sets T. Given a function ip : X — > 
K U {-co} and a lower semi-continuous cost function c : X x Y — > R>o U {+oo}. 
Then ip is called c-concave on Y, if there is a function 4> : X — > M. U {-co}, such 
that 

V>(y) = inf. (c(x,y) - cj)(x)) 

for y G y. Accordingly we define c-concave functions on X. 
Let </>:X^K.U{— oojbea function not identical to — oo, then 

(1) <j?{y)= te{c{x,y)-4>(x)) 

is called the c-transform of <p, which is obviously a c-concave function on Y. We have 
that <fi is c-concave on X if and only if <f> = tp cc . The c-superdifferential d c <fi C X xY 
of a c-concave function <f> is defined as the set of all pairs (x, y) G X x Y such that 
for all z G X, 

<j)(z) < (f>(x) + (c(z, y) - c(x, y)). 

Ruschendorf's Theorem finally states that a set V C X x Y is c-monotone if and 
only if there is c-concave function <j) such that Y C d c <f) (see [SI ch.2] and [S] for 
further details and references). 

We cite the Theorem from [T], which inspired our subsequent considerations. 

Theorem 1. Let X,Y be polish spaces and [i,v be Borel probability measures 
thereon. Assume that c is a finitely-valued, lower semi- continuous cost function 
and that tt is c-monotone. Furthermore we assume that 



v I {y, J c(x, y)fi(dx) < oo}j > 0, 
\i ( {x, / c{x,y)v{dy) < oo} I > 0. 



V JY / 

Then it is an optimizer and there exists a dual optimizer {(j),ip) G C l {^i) x C 1 ^), 
where <f> is a c-concave Borel function, ip is a Borel function, ip — (\f almost surely 
with respect to v and 

(f>(x) +ip(y) < c(x,y) 
for (x, y) G X xY , where equality holds ir-almost surely. 
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We shall prove the same assertion without assuming that c is finitely valued and 
without any assumptions on the existence of moments of c. Here we do assume 
a measurability condition related to c-monotone transport plans, namely the ex- 
istence of a Borel-measurable "hull" for a sequence of dual optimizers. We shall 
prove (Proposition [T] below) that this measurability condition always holds true for 
finitely valued, lower semi-continuous cost functions c thus covering in particular 
the setting of problem 2.25 in [8]. 

Definition 2. Given polish spaces X,Y, a lower semi- continuous cost function 
c : X x Y — > M>o U {+oo}, we call a Borel set rcIxY strongly c-monotone, if 
there exist Borel functions i^:X->tU {— oo} and ip : Y — > K. U {-co} such that 

<t>(x) +ip(y) < c(x,y), 

for all (x,y) £ X x Y and equality holds if (x,y) £ T. We call a transport plan tt 
strongly c-monotone if there exists a strongly c-monotone Borel set T C X x Y , on 
which tt is concentrated, i.e. tt(T) = 1. 

Remark 1. A seemingly innocent, but important point is the Borel measurability 
of <j> and tp. 

Remark 2. Notice that we do neither assume the Borel-measurable functions (p, tp 
to be c-concave nor to be conjugate. 

Remark 3. A strongly c-monotone set T C X xY is c-monotone, since 

n n 

^c{x l ,y i ) = 22(<p(xi) +tp(y l )) 

n 

i=l 
n 

< ^2c{x l ,y a[i) ), 

i=l 

for all (xi, yi)i=i t ..., n C r and permutations a £ 6 n for n > 1. 

The following Proposition, which is based on the proof of Theorem 3.2 in p], 
shows that all c-monotone transport plans are strongly c-monotone pro- 
vided that c is finitely valued. 

Proposition 1. Given polish spaces X,Y, a finitely valued, lower semi- continuous 
cost function c : X x Y — > M>o and a c-monotone transport plan tt. Then there 
exist Borel-measurable functions <f>, tp on X, Y , respectively, taking values in R U 
{— oo}, such that <f> is c-concave, the c-superdifferential d c (f> contains a set where tt 
is concentrated, tp coincides v -almost surely with (p c , the inequality 

(p{x) +ip{y) < c(x,y) 

holds true for all (x, y) £ X x Y , and equality holds true ix-almost surely. In 
particular tt is strongly c-monotone. 

Proof. In order to apply a construction of [1] we have to show that there is a c- 
monotonc Borel set T = U^iT^ with Tk compact, c|r fc continuous for k > 1 and 
7r(r) = 1. Take any c-monotone Borel set T' with 7r(r') = 1, which exists by 
assumption. By Egorov's Theorem and the inner regularity on polish spaces of the 
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transport plan ir (see [4] for both results), we know that there are compact subsets 
Lk C X x Y, such that n(X x Y \ Lk) < \ for k > 1 and c\L k is continuous for 
k > 1. Consequently we can define compact subsets r& := V fl Lk C £fc C X X Y 
for fc > 1. In particular c\r k is continuous for k > 1 and the Borel set T :— L)k>x^k 
satisfies T D V by construction. Since c\r k is continuous and V C\Lk is c-monotone 
by assumption, the c-monotonicity extends to the closure Tk := V fl L^ for fc > 1. 
A union of c-monotone sets is c-monotone, so T is c-monotone. Whence we have the 
claim and can apply pQ, Th. 3.2 (step 1 of the proof) in order to construct - in the 
spirit of Riischendorf's Theorem - a Borel- measurable function 0:X-^RU{— oo} 
with T C d c (j). 

We then define <j) c via ([T]) and obtain by applying again pQ, Th. 3.2 (step 2 of the 
proof), that <jf is ^-measurable. Setting the values of <f> c to — oo on a ^-negligible 
set we can define a Borel measurable function ip on Y such that 

(j)(x) +ip{y) < c(x,y) 

holds true for all (x, y) G X x Y. If (x, y) G T C d c <j>, we know from the definition 
of the superdiffercntial of <j> that 

<j>(z) < <j)(x) + (c(z, y) - c(x, y)) 



c(x, V) ~ <j>{x) < c(z, y) - <j>(z) 



for all z G X, hence 
for all z, so 

c(x,y) - <j>{x) < ip(y), 
for (x,y) G r. Hence equality in 

(j)(x) +ip(y) < c(x,y) 
holds true 7r-almost surely, since 7r(r) = 1. □ 

Next we show that optimal transport plans are strongly c-monotone 

provided that c is [i ® z/-almost surely finitely valued. This result suggests to work 
with strongly c-monotone transport plans instead of c-monotone ones. For the 
proof we need the following Lemma, which generalizes results in [5j and [7] (see in 
particular the references therein). 

Lemma 1. Let (Qi,J-'i, jii) be probability spaces and let <p n : f2i — > K and ip n : 
— > K be measurable functions for n > 1. ^Issitme that 

converges in fii <g> 112-probability to £ : £!i x 5^2 — > IR U {— oo}. TTien £/iere exist 
rea/ numbers (r n )„>i awe? measurable functions : fii — > RU{— oo}, -0 : — > 
K U {— 00}, smc/i i/iai - along a fixed subsequence - (<f> n + r n ) n>Q converges in prob- 
ability to (j) and (jp n — r n ) n>0 to ip. Furthermore, 

£(u>i,w 2 ) = 0(wi) + 0(w 2 ) 

Mi ® fi2~almost surely. 

Proof. We choose a complete, bounded metric d on lU {—00}. Then £ n — > £ in 
probability as n — > 00 is equivalent to i?((f (£„,£)) — » as n — » 00. By Fubini's 
theorem the real- valued function 

^2 / d(^„(wi,o;2),C( w i7 w 2 ))/xi(dwi) 
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is /!2-almost surely well-defined and converges in ^-probability to as n — > oo by 
assumption. 

We shall now distinguish two cases, either £ is finitely valued with positive prob- 
ability, or /ii(g)/i2-almost surely £ = — oo. First we assume that there is A C fli X 
such that £1^4 > —00 and (/zi <£) ^{A) > 0. Then there is W2 € ^2 such that 

?n(wi,t^) = 4>n{u\) + lpn{u2) ~> 0(^1 ) := ^(^1,55) 

converges in /ii-probability on f^i as n — > oo, and the function (f> : Qi — > R U {—00} 
is measurable with (j) > —00 on a set of positive /xi -probability. Here we possibly 
have to pass to a subsequence to guarantee that there is UJ2 where ^2) converges 
in probability. We assume from now on that we have already passed to such a 
subsequence. We then define r n := 4>n(^2), for n > 1. Furthermore we obtain that 
the sequence 

^Pn T71 — £n 4*n 

converges in ^-probability on as n — > 00 to a function ip : Q2 — > 1U{- 00}. 
Indeed, the right hand side does not depend on the first variable uj\ by construction. 
We choose a measurable subset A\ c £l\ of positive /ii-probability with 4>\A t > —00. 
The sequence (£n)n>i converges in probability to £ and the sequence (4> n + r n ) n >i 
converges in probability to <fi on A±, so the difference converges in probability on 
Ai x fi 2 to a limit ip, which does not depend on the first variable. Hence in particular 

£(wi,cj 2 ) = 0(^i ) + ^(w 2 ) 

Mi ® /Ua-almost sure. 

Now we assume that £ is almost surely equal to —00. We know that, for each 
fixed < a < 1, the a-quantiles 

q n {a) := inf{x; /ii(^„ > x) < a}, 

ql(a) := inf{y; /i 2 C0n > y) < a} 

have the property 

lim (q n (a) + q 2 n {a)) = -00 

n — >oo 

by assumption. Indeed, otherwise there is a subsequence (n/c)fc>i and a lower bound 
M > -00 such that qn k (a) + g£ k (a) > M for all k > 1. Hence </>„ fc + ^„ fc > M 
with /ii ® /i2-probability greater than a 2 for fc > 1. However, this contradicts the 
convergence in probability to —00. By choosing a sequence (a„)„>i with a n j. 
slowly enough as n — > 00, we can maintain this property, i.e. 

lim (g^(a„) + ql(a„)) = -00. 

n — >oc 

We define now 

z n ■= \{ql{un) +q 2 n {a n )) 

= - z n = -(<ln( a n) - Zn)- 

We have lining oo z n = — 00 and hence <fi n +r n — > —00 and ip n ~fn ~ 00 converges 
in the respective probabilities as n — > 00 by construction. □ 

In the proof of the subsequent proposition we shall apply a Komlos-type result, 
which we cite here from [3j Lemma Al.l] for the sake of completeness. 
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Lemma 2. Let (f n )n>i be a sequence o/R>o U {+00} -valued random variables on 
a probability space (fi, T, P). Then there is a sequence g n € conv(/„, f n +i, • • • ), the 
convex hull of f n , f n +i, ■ ■ ■ , for n > 1, which converges almost surely to a random 
variable g taking values in R>o U {+00}. 

The interesting feature of this result is that we do not need any integrability 
assumption on the sequence (/ n )n>i t° obtain an almost sure convergence of con- 
vex combinations. It suffices to assume non- negativity of the functions f n . This 
situation is related to the setting of [2] , where it is shown that a Bipolar-Theorem 
can be formulated for the non- locally-convex space L°, provided that one restricts 
to positive functions. 

Proposition 2. Let c : X x Y — > R>o U {+00} be a lower semi-continuous cost 
function on polish spaces X,Y. Given Borel probability measures fx on X, v on Y 
and an optimizer tt £ n(/i, v) with I(tt) < 00. We assume that c is /i <g> v-almost 
surely finitely valued. Then there exist Borel functions <p, ip on X, Y , respectively, 
taking values inlU {—00}, such that 

(f>(x) + ip(y) < c(x, y) 

for x € X and y € Y , and 4>(x) + ip{y) — c(x, y) almost surely with respect to tt. In 
particular the transport plan tt is strongly c-monotone. 

Proof. By [1], Th. 3.1, we know that there is a maximizing sequence (4> n ,ipn) of 
bounded Borel functions on X,Y, respectively, such that 

lim E%(£ n ) = lim E = E^(c) = 

n — >og n — ►oo 

where £ n (x, y) = 4> n {x) + tp n (y) < c(x, y) for all (a;, y) £ X xY and the convergence 
of (£' At ig);y(Cra))n>o is monotonely increasing. By passing to convex combinations 
(apply the above Lemma [2] with f n — c — £ n for n > 0) we may assume that 
converges /i ® v- almost surely to a Borel function £ taking values in R U {— 00}. By 
Lemma [T] we know that 

iix,y) = <t>(x)+ip(y), 
with functions <fi and ip, which are \i- and {/-almost surely defined, respectively. 
Furthermore - by passing to a subsequence if necessary - there is a sequence of real 
numbers (r n ) n >i such that <p n + r n — > cj> and ip n — r n — > tp as n — * 00 almost surely 
with respect to [i and is, respectively. Choosing appropriate nullsets in X and Y, 
we can redefine <p, <p n and ip,ip n on these nullsets by —00, such that <p n + r n —> <f> 
and ip n — r n — > ip surely, without violating the now sure inequality 

£(x, y) = (f>(x) + ip(y) < c(x, y). 

Obviously E^Qc — £ n |) = E^(c — £ n ) — > 0, as n — * 00, by assumption, hence 
4>{x) + ip(y) = c{x, y) almost surely with respect to tt. □ 

Our main Theorem below states that strongly c-monotone transport plans are 
always optimal. For sake of clarity we shall formulate an elementary lemma on 
monotone convergence of truncations, whose proof is obvious. 

Lemma 3. Let a, b be real numbers. We define 

a n := (—n V a) A n, 

b n := (—n V b) An, 

• &71 ~t~ bn ' 
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Then we have £o = and (£n)n>o converges monotonically to a + b, i.e. ifa + b>0 
the sequence (£ n )n>0 increases to a + b and if a + b < the sequence decreases to 
a + b. 

Theorem 2. Let c : X x Y — > R>o U {+00} fee a Zower semi- continuous cost 
function on polish spaces X,Y. Given Borel probability measures fi on X, v on Y 
and a strongly c-monotone transport plan n G n(/i, v), then n is an optimizer of 
the Monge-Kantorovich problem. 

Proof. We assume that there is a transport plan ttq with finite cost I(iro) < 00. We 
aim to show that I(tt) < I(ttq)- By the assumption of strong c-monotonicity of 7r 
there exist Borel-measurable functions (f), tp on X, Y, respectively, taking values in 
R U {-00}, such that for all (x, y) G X x Y 

<j»(x) +ijj(y) < c(x,y), 

where equality holds 7r-almost surely We define 

4> n {x) := (—n V 4>(x)) A n, 
ll>n(y) {~n V ip(y)) An, 
£n(x,y) = 4>n(x) +i/j n (y), 

for (x, y) G X x Y and n > 0. Due to the previous Lemma observe that f £ on 
{£ > 0} and £„ i £ on {£ < 0}, as n — > 00. 

Additionally, E T (^) — E^{c) exists, taking possibly the value +00, since equality 
£ = c holds 7r-almost surely and c > 0. The integral of £ with respect to n exists, 
too, as 

E W0 (£) < E W0 (c) < 00. 

Note that E Vo (^) possibly equals —00. By the assumption on equal marginals of it 
and 7To we obtain 

= E„(cf> n ) + E^(ip n ) 

= E nQ ((/) n ) + E Wo (lp n ) 
— E Vo (£ ra ) , 

for n > 0, hence 

^7r(£ral{5>0} +£ril{5<0}) = E„ (£nl{£>0} + £n !{£<()})• 

By our previous considerations we can pass to the limits and obtain £^(0 = -EW (£)■ 
Indeed the limits are monotone on {£ > 0} and {£ < 0}, the expectation with 
respect to 7r of £_ is finite, namely E„(£—) = 0, and the expectation with respect 
to 7T of £+ is finite, _E„. (£ + ) < 00. Hence the limits of E v (^ n ) = E no (^ n ) exist as 
n — > 00 and are equal. Consequently /(tt) = E n (£) = E %0 (£) < I(ttq). □ 

The following Corollary answers Problem 2.25 of [8] pertaining to the quadratic 
cost function affirmatively. In fact, all finite, lower semi-continuous cost functions 
are covered and no additional assumptions on integrability or measurability are 
necessary. 

Corollary 1. Let c : X x Y —> R>o be a finitely valued, lower semi- continuous 
cost function on polish spaces X,Y. Given Borel probability measures [i on X, v 
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c(x,y) 



on Y and a c-monotone transport plan tt £ n(/i, v), then n is an optimizer of the 
Monge-Kantorovich problem. 

Proof. Due to Proposition [TJ for a finitely valued, lower semi-continuous cost func- 
tion c, a c-monotone transport plan tt is strongly c-monotone. Hence Theorem [2] 
applies. □ 

The subsequent example, which is taken from pQ, Example 3.5, shows that there 
are c-monotone transport plans, which are not strongly c- monotone. 

Example 1. This example is just a re-interpretation of the illuminating Example 
3.5. in 1 . Take X = Y = T 1 = {exp(27rir), < r < 1} C C the 1- dimensional 
Torus with uniform distribution [i — v and choose a £ T 1 , so that {a n ,n > 0} is 
dense in T , i.e. a = exp(27rir) with r an irrational number: we define a lower 
semicontinuous cost function c : X x Y — » M>o U {+00} via 

1 if x = y 
2 if x = ya 

and +00 elsewhere. Then there are two disjoint c-monotone subsets of X x Y , 
namely 

Ti = {(x,x),x E T 1 }, 

T2 = {(x,xa),x e T 1 }. 

For the first set strong c-monotonicity is clear with <^i(exp(27rir)) = r for r S [0, 1[, 
and ^i(exp(ir)) = vni x {c{x, exp(ir)) — <f>i (x)) = 1 — r for r £ [0, 1[. Then we obtain 
the result, that 4>i(x) + ipi(y) < c(x, y) with equality if x = y. 

The second set T2 is c-monotone (see [1], Example 3.5.), but not strongly c- 
monotone (this follows from non-optimality, but we shall also show it directly). 

Take n > 1 and (xi, XiOt) for i = 1, . . . ,n. Then for all a £ 6 n , 

n n 

^ c(xi,XiCt) < 2J c(xj, x CT (j)a). 

j=l i=l 

Indeed, otherwise there is a € 6 n and a£ /easf one cc< £ T 1 swc/i f/iat c(xj, a; CT (j)Q!) 
1 and therefore 

Xi — 

Iteration of this equation yields a m = 1 for some m > 1, a contradiction showing 
the c-monotonicity ofT2- 

We shall now show that T2 fails to be strongly c-monotone. In fact, we shall 
show the stronger assertion that, for every Borel measure B C T 1 with pi{B) > 
the set 

T 3 = {(x, xa),x £ B} 

fails to be strongly c-monotone. Indeed, we suppose that there are Borel-measurable 
(f>2 and ip2 taking value in EU {— 00}, such that 

(2) fa{x) + Mv) < c(x,y) 

for all (x,y) € T 1 x T 1 , with equality holding for xa = y and x G B. Using the 
special form of c, we see that implies 

(3) + < 1. 

(4) 4> 2 {x) + ip 2 (xa) = 2, 
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for x £ B and z £ T 1 . Combining and O) we obtain 
(5) fa(x) ><j> 2 {xa) + l 

/or x £ B or for xa £ £>. Defining Bk '■= B fl ^ 1 ([A;, A: + 1[) /or fc £ Z, we obtain 
a partition {Bk)k& of B. We consider now an equivalence relation on T , namely 
x ~q J/ if xa m = y for some m £ Z. Fia; /c £ Z; due to[5| i/iere con 6e ai mos< one 
element of each equivalence class [x] C T 1 m Bfl^^ft fc+l[). On the other hand 
every Borel set A <Z T 1 , which contains at most one element of each equivalence 
class [x], has measure ^{A) = (since then U me z(Aa m ) G T , Aa m are pairwise 
disjoint and fi(Aa m ) — fJi(A) for m £ Z by translation invariance). Consequently 
we obtain a contradiction to the assumption that the sets Bk are Borel sets. 

We can finally formulate an approximation result for the optimal transport plan 
of Monge-Kantorovich problems. The result can also be interpreted as discretization 
result for numerical purposes. In particular, we obtain - without any moment 
conditions - that the adherence points of candidate approximations are optimizers, 
provided the cost function c is continuous. 

Definition 3. Let c : X x Y — > K>o U {+00} be a lower semi- continuous cost 
function on polish spaces X, Y. Given Borel probability measures /1 on X and v 
on Y . Assume that there are sequences (/i„)„>i and (v n ) n >i of Borel probability 
measures converging weakly to fi and v on the respective spaces X and Y. We 
denote by 7r„ a solution of the Monge-Kantorovich problem associated to (/i n , v n , c) 
for n > 1 and we assume that 

/ c(x,y)ir n (dx 7 dy) < 00, 

JXxY 

for each n > 1. We call the sequence (Tt n )n>i an approximating sequence of opti- 
mizers. 

Theorem 3. Let c : X x Y — * M>o be a finitely valued, continuous cost function 
on polish spaces X,Y. Let (7T„) n >i be an approximating sequence of optimizers 
associated to weakly converging sequences ji n — ► ji and v n — > v as n —> 00. Then 
there is a subsequence (TT nk )k>o converging weakly to a transport plan ir on X xY , 
which optimizes the Monge-Kantorovich problem (/x, u, c). Any other converging 
subsequence of (7r„)„>o also converges to an optimizer of the Monge-Kantorovich 
problem, i.e. the non-empty set of adherence points of (jr n )n>o is a set of optimizers. 

Proof. Fix e > 0. By Prohorov's Theorem (see for instance [4], Th. 16.3) there are 
compact sets K C X and L C Y such that /i n (X \ K) < e and v n (Y \ L) < e for all 
n. Hence k„(X xY\K X L) < n n (X x (Y \ L)) + ir n ({X \K)xY) < 2e. Again 
by Prohorov's Theorem we know that there is a weakly converging subsequence 
{ir nk )k>o with weak limit ir. We have to prove that n is c- monotone, since by 
Proposition [T] the transport plan n is then strongly c-monotone and hence optimal. 
We know that for m > 1 and points (xi, yi) £ supp7r for i = 1, . . . ,m, there exist 
sequences of points (x^,yf) k >o such that (xf,yf) £ supp7r„ fc and (x$,y£) -> (xj.j/j) 
as k — > 00 (see again [4], Th. 16.3). Since 7r„ fc is c-monotone by Proposition [2] and 
Remark |31 and since c is continuous, supp7r nfc is a c-monotone subset of X x Y, 
hence 

m rn 
E^'Hi) <5Z C (^(i)>2/a(i)) 

8=1 1=1 
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for permutations a € & m , whence also for k — > oo by continuity of c. By the 
same reasoning any converging subsequence of (7r„)„>o converges to a c-monotone 
transport plan, which is by Proposition [1] a strongly c-monotone one and hence 
optimal by Theorem [2j □ 

Remark 4. In particular the set of optimizers of a Monge-Kantorovich problem is 
compact in the weak topology on probability measures, since the constant sequences 
/i n = \x and v n = v for n > are approximating sequences. This also yields that 
we cannot improve the result to the assertion on convergence of the sequence, but 
we point out that the convergence of the sequence in Theorem [3] holds true if and 
only if the optimizer is unique. 

Example 2. Let c be a continuous, finitely valued cost function on the product 
X x Y of two polish spaces X, Y and let (X, v be probability measures thereon. By 
the law of large numbers we can always find empirical distributions (see for instance 
4J, Prop. 4.24) 



(In 



1 " 



i=l 



1 ™ 



i=l 

which converge weakly to [i and v, respectively. Each associated approximating 
sequence of optimizers (7T n )„>i is given through a sequence of permutations a n £ 
& n , n > 1, and 

1 " 

= "I]W.W)' 
i=l 

Notice that always I(ir n ) = J x y c(i, y)ir n (dx, dy) < 00 for each fixed n6ff. We 
deduce from Theorem^ that all adherence points of (7r„) n >i are optimizers of the 
Monge-Kantorovich problem and that there is at least one adherence point. Notice 
that each 7r n for n > 1 is obtained by solving a finite optimization problem. 
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